Distributed application optimization using service groups

ABSTRACT

Embodiments are directed to managing multiple different types of applications using service groups. In one scenario, a computer system receives an indication of one or more application dependencies and characteristics that are to be implemented when an application is provisioned on a distributed host computer system. The computer system creates an application manifest that declaratively defines application dependencies and characteristics for various different service groups. Each service group includes applications that match the declaratively defined application dependencies and characteristics. The computer system also sends the manifest to the distributed host computer system which loads those applications that fit the manifest criteria onto available nodes of the distributed host computer system according to the service groups specified in the manifest.

BACKGROUND

Computers have become highly integrated in the workforce, in the home, in mobile devices, and many other places. Computers can process massive amounts of information quickly and efficiently. Software applications designed to run on computer systems allow users to perform a wide variety of functions including business applications, schoolwork, entertainment and more. Software applications are often designed to perform specific tasks, such as word processor applications for drafting documents, or email programs for sending, receiving and organizing email.

In some cases, software applications are hosted on large, distributed computing systems such as cloud computing systems. These applications are provided to large numbers of users, spread over wide geographic areas. The applications (or instances thereof) may be run on various different physical and/or virtual computer systems located in geographically-dispersed areas. Managing these distributed applications can require a substantial effort.

BRIEF SUMMARY

Embodiments described herein are directed to managing multiple different types of applications using service groups. In one embodiment, a computer system receives an indication of one or more application dependencies and characteristics that are to be implemented when an application is provisioned on a distributed host computer system. The computer system creates an application manifest that declaratively defines application dependencies and characteristics for various different service groups. Each service group includes applications that match the declaratively defined application dependencies and characteristics. The computer system also sends the manifest to the distributed host computer system which loads those applications that fit the manifest criteria onto available nodes of the distributed host computer system according to the service groups specified in the manifest.

In another embodiment, a computer system receives an application manifest that declaratively defines application dependencies and characteristics for different service groups, so that each service group includes applications that match the declaratively defined application dependencies and characteristics. The computer system determines which applications are to be loaded in a specified service group and loads those applications that fit the manifest criteria for the specified service group on a single node of the distributed computer system according to the manifest, so that those applications that fit the manifest criteria for the specified service group are co-located. Co-location allows applications to failover and migrate together.

This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.

Additional features and advantages will be set forth in the description which follows, and in part will be apparent to one of ordinary skill in the art from the description, or may be learned by the practice of the teachings herein. Features and advantages of embodiments described herein may be realized and obtained by means of the instruments and combinations particularly pointed out in the appended claims. Features of the embodiments described herein will become more fully apparent from the following description and appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

To further clarify the above and other features of the embodiments described herein, a more particular description will be rendered by reference to the appended drawings. It is appreciated that these drawings depict only examples of the embodiments described herein and are therefore not to be considered limiting of its scope. The embodiments will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:

FIG. 1 illustrates a computer architecture in which embodiments described herein may operate including managing multiple different types of applications using service groups.

FIG. 2 illustrates a flowchart of an example method for managing multiple different types of applications using service groups.

FIG. 3 illustrates a flowchart of an alternative example method for managing multiple different types of applications using service groups.

FIG. 4 illustrates an embodiment in which a distributed host computer system hosts multiple applications on different computing nodes.

DETAILED DESCRIPTION

Embodiments described herein are directed to managing multiple different types of applications using service groups. In one embodiment, a computer system receives an indication of one or more application dependencies and characteristics that are to be implemented when an application is provisioned on a distributed host computer system. The computer system creates an application manifest that declaratively defines application dependencies and characteristics for various different service groups. Each service group includes applications that match the declaratively defined application dependencies and characteristics. The computer system also sends the manifest to the distributed host computer system which loads those applications that fit the manifest criteria onto available nodes of the distributed host computer system according to the service groups specified in the manifest.

In another embodiment, a computer system receives an application manifest that declaratively defines application dependencies and characteristics for different service groups, so that each service group includes applications that match the declaratively defined application dependencies and characteristics. The computer system determines which applications are to be loaded in a specified service group and loads those applications that fit the manifest criteria for the specified service group on a single node of the distributed computer system according to the manifest, so that those applications that fit the manifest criteria for the specified service group are co-located. Co-location allows applications to failover and migrate together.

The following discussion now refers to a number of methods and method acts that may be performed. It should be noted, that although the method acts may be discussed in a certain order or illustrated in a flow chart as occurring in a particular order, no particular ordering is necessarily required unless specifically stated, or required because an act is dependent on another act being completed prior to the act being performed.

Embodiments described herein may comprise or utilize a special purpose or general-purpose computer including computer hardware, such as, for example, one or more processors and system memory, as discussed in greater detail below. Embodiments described herein also include physical and other computer-readable media for carrying or storing computer-executable instructions and/or data structures. Such computer-readable media can be any available media that can be accessed by a general purpose or special purpose computer system. Computer-readable media that store computer-executable instructions in the form of data are computer storage media. Computer-readable media that carry computer-executable instructions are transmission media. Thus, by way of example, and not limitation, embodiments described herein can comprise at least two distinctly different kinds of computer-readable media: computer storage media and transmission media.

Computer storage media includes RAM, ROM, EEPROM, CD-ROM, solid state drives (SSDs) that are based on RAM, Flash memory, phase-change memory (PCM), or other types of memory, or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store desired program code means in the form of computer-executable instructions, data or data structures and which can be accessed by a general purpose or special purpose computer.

A “network” is defined as one or more data links and/or data switches that enable the transport of electronic data between computer systems and/or modules and/or other electronic devices. When information is transferred or provided over a network (either hardwired, wireless, or a combination of hardwired or wireless) to a computer, the computer properly views the connection as a transmission medium. Transmission media can include a network which can be used to carry data or desired program code means in the form of computer-executable instructions or in the form of data structures and which can be accessed by a general purpose or special purpose computer. Combinations of the above should also be included within the scope of computer-readable media.

Further, upon reaching various computer system components, program code means in the form of computer-executable instructions or data structures can be transferred automatically from transmission media to computer storage media (or vice versa). For example, computer-executable instructions or data structures received over a network or data link can be buffered in RAM within a network interface module (e.g., a network interface card or “NIC”), and then eventually transferred to computer system RAM and/or to less volatile computer storage media at a computer system. Thus, it should be understood that computer storage media can be included in computer system components that also (or even primarily) utilize transmission media.

Computer-executable (or computer-interpretable) instructions comprise, for example, instructions which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. The computer executable instructions may be, for example, binaries, intermediate format instructions such as assembly language, or even source code. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the described features or acts described above. Rather, the described features and acts are disclosed as example forms of implementing the claims.

Those skilled in the art will appreciate that various embodiments may be practiced in network computing environments with many types of computer system configurations, including personal computers, desktop computers, laptop computers, message processors, hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, mobile telephones, PDAs, tablets, pagers, routers, switches, and the like. Embodiments described herein may also be practiced in distributed system environments where local and remote computer systems that are linked (either by hardwired data links, wireless data links, or by a combination of hardwired and wireless data links) through a network, each perform tasks (e.g. cloud computing, cloud services and the like). In a distributed system environment, program modules may be located in both local and remote memory storage devices.

In this description and the following claims, “cloud computing” is defined as a model for enabling on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications, and services). The definition of “cloud computing” is not limited to any of the other numerous advantages that can be obtained from such a model when properly deployed.

For instance, cloud computing is currently employed in the marketplace so as to offer ubiquitous and convenient on-demand access to the shared pool of configurable computing resources. Furthermore, the shared pool of configurable computing resources can be rapidly provisioned via virtualization and released with low management effort or service provider interaction, and then scaled accordingly.

A cloud computing model can be composed of various characteristics such as on-demand self-service, broad network access, resource pooling, rapid elasticity, measured service, and so forth. A cloud computing model may also come in the form of various service models such as, for example, Software as a Service (“SaaS”), Platform as a Service (“PaaS”), and Infrastructure as a Service (“IaaS”). The cloud computing model may also be deployed using different deployment models such as private cloud, community cloud, public cloud, hybrid cloud, and so forth. In this description and in the claims, a “cloud computing environment” is an environment in which cloud computing is employed.

Additionally or alternatively, the functionally described herein can be performed, at least in part, by one or more hardware logic components. For example, and without limitation, illustrative types of hardware logic components that can be used include Field-programmable Gate Arrays (FPGAs), Program-specific Integrated Circuits (ASICs), Program-specific Standard Products (ASSPs), System-on-a-chip systems (SOCs), Complex Programmable Logic Devices (CPLDs), and other types of programmable hardware.

Still further, system architectures described herein can include a plurality of independent components that each contribute to the functionality of the system as a whole. This modularity allows for increased flexibility when approaching issues of platform scalability and, to this end, provides a variety of advantages. System complexity and growth can be managed more easily through the use of smaller-scale parts with limited functional scope. Platform fault tolerance is enhanced through the use of these loosely coupled modules. Individual components can be grown incrementally as business needs dictate. Modular development also translates to decreased time to market for new functionality. New functionality can be added or subtracted without impacting the core system.

FIG. 1 illustrates a computer architecture 100 in which at least one embodiment may be employed. Computer architecture 100 includes computer system 101. Computer system 101 may be any type of local or distributed computer system, including a cloud computing system. The computer system includes various modules for performing a variety of different functions. For instance, an application manifest generating module 110 may be used to generate an application manifest from application dependencies 106 and characteristics 107 provided by a user 105. The application manifest 115 may include descriptions or declarative definitions of applications that are to be included in a given service group. For instance, if an application needs certain computing power guarantees, those needs can be indicated in the application characteristics 107. If an application is dependent on another application or service, those dependencies can be indicated in the application dependencies 106. Many other dependencies and/or characteristics may be indicated by the user 105.

Service groups, as referred to herein, are groups of applications or services that meet one or more criteria including dependencies 106 and/or characteristics 107. Service groups may be implemented to provide computing performance and scale capabilities to hosted applications or services. Services within a service group may be unaware that they (or other services) are part of a service group (or not part of a service group). As such, a programming model or service model for a service will be the same whether it is part of a service group or not. This allows composition of related services (e.g. related in application logic), as well as unrelated services in the same service group. Service groups may accommodate grouping of applications or services written in native code, managed code or both native and managed code services. The service host of the service group (e.g. distributed host computer system 130) can be native as well as managed.

When service-group-specific features and capabilities are accessed by services within the service group (e.g. applications 120A within service group 116A), the service model is explicit. In such cases, functionality is available only to services in that service group (and not regular services) through explicit APIs and constructs. The application developer (e.g. user 105) may be aware of the service groupings. Services in the service group can be of the same or different service types. It should be noted that although the terms “application” and “service” both refer to portions of executable software code, an application may include one or more service groups and one or more services. A service model may be used to express the model of grouped services, and may further be used to manage the service groups.

Services that are part of the same service group may be grouped together on the same physical hardware (e.g. on node A (131A) of the distributed host computer system 130. Such services are said to be “co-located”. Co-located services are load balanced together, and also failover together if there is a hardware failure. Thus, if a particular node (e.g. 131B or 131C) becomes available to handle some of node 131A's load for example, co-located services running on node 131A will be moved together from 131A to 131B or 131C. Similarly, if two co-located services are running on node 131A and it fails (for whatever reason), the two co-located services may be migrated to node 131B or 131C together. This failover or load balancing may occur automatically, as managed by the distributed host computer system. Although the above example describes co-locating two services, it should be understood that substantially any number of services or applications may be co-located. Load balancing properties (placement constraints, load utilization reporting, etc.) may be applied at the service group level. Each service member in the service group will report its load metrics, and the service group will perform appropriate aggregation.

Service groups can run in any node of the distributed host computer system 130. A service group with a different instance can be activated in the same or a different host node. Different service groups can be activated in the same host. Services in the service group can provide their own binaries. Both service groups and services within the service group are named entities. Using naming client APIs, a client can resolve the service group name, and the name of any service within that service group. Moreover, a service within the service group can resolve and obtain access (via a service endpoint or interface pointer) to any other service within that same service group for efficient inter-service communication. The interface pointer access can also be used for efficient inter-service communication within the same service host process. These concepts will be explained further below with regard to methods 200 and 300 of FIGS. 2 and 3, respectively.

In view of the systems and architectures described above, methodologies that may be implemented in accordance with the disclosed subject matter will be better appreciated with reference to the flow charts of FIGS. 2 and 3. For purposes of simplicity of explanation, the methodologies are shown and described as a series of blocks. However, it should be understood and appreciated that the claimed subject matter is not limited by the order of the blocks, as some blocks may occur in different orders and/or concurrently with other blocks from what is depicted and described herein. Moreover, not all illustrated blocks may be required to implement the methodologies described hereinafter.

FIG. 2 illustrates a flowchart of a method 200 for managing multiple different types of applications using service groups. The method 200 will now be described with frequent reference to the components and data of environment 100.

Method 200 includes an act of receiving an indication of one or more application dependencies and characteristics that are to be implemented when an application is provisioned on a distributed host computer system (act 210). For example, application manifest generating module 110 may receive application dependencies 106 and/or application characteristics 107 that are to be implemented when an application (e.g. one of applications 120A) is provisioned on distributed host computer system 130. The distributed host computer system 130 may be any type of distributed or cloud computing system and may include substantially any number of computing nodes, each node having a variable number of processors, threads, virtual computer systems or other hardware or logical components.

Method 200 includes an act of creating an application manifest that declaratively defines application dependencies and characteristics for each of a plurality of different service groups, such that each service group includes applications that match the declaratively defined application dependencies and characteristics (act 220). The applications of a service group (e.g. applications 120A of service group 116A) include those applications or services that meet the dependencies and characteristics for that service group. Thus, for service group 116B, applications 120B would include those applications that met the dependencies 106B and characteristics 107B described in the manifest 115. The manifest may include dependencies and characteristics for substantially any number of different service groups.

Examples of application characteristics may include computational power specifications for an application. An administrator or application developer can group applications together into a service group based on the application's need (high or low) for processing power. Application characteristics may also include the nature of the clients accessing the application. For instance, if one application is accessed by a very large number of people, it may be grouped in service group 116A, while applications that have a relatively low access rate may be grouped in service group 116B. Similarly, if an application needs to scale at a certain time (e.g. a shopping cart application near the Christmas holiday), it may be grouped with other such applications. Thus, an administrator, application developer, IT professional or other user may group applications using application characteristics. Moreover, if one application or service is dependent on another for input (or for any other reason), the user 105 may indicate that the application is dependent on the other, and that for one to run, both need to be running In this manner, a user may provide substantially any type of application characteristic or dependency on which a service group may be based. The characteristics and dependencies may also be time based, such that certain characteristics or dependencies only apply on weekdays or weekends, or during certain hours of the day.

In some embodiments, one of the application dependencies declaratively defined in the application manifest 115 may include an estimated workload for the application (e.g. an indication of the estimated amount of computing resources the application will consume). Another of the application dependencies declaratively defined in the application manifest may include desired (or required) hardware characteristics for hardware that runs the application(s). The service groups may include different application types including applications written in managed code (e.g. C#), or applications written in native code (e.g. C++), or both managed and native applications. The manifest 115 indicates whether the application is managed or native. Similarly, service groups may include both related and unrelated applications. Thus, as shown in FIG. 4, a single node (e.g. 431C) may include application A (instance 420A4), application B (instance 420B1) and application C (instance 420C2), each of which may be related or unrelated to the other. Application developers do not need to indicate or even think about which service group they are part of, as service groups can accommodate all types of applications.

The service groups allow the applications to be scaled over multiple different nodes in the distributed host computer system. For instance, as shown in FIG. 4, distributed host computer system 430 may include three nodes (more or fewer are possible), nodes 431A, 431B and 431C. Each node may run multiple applications or application instances. For example, node 431A is shown as running two applications (two instances of application A (420A1 and 420A2), and application B (instance 420B1)). Node 431B is shown as running instance 420A3 of application A and instance 420C1 of application C, while node 431C is shown as running instance 420A4 of application A, instance 420B2 of application B and instance 420C2 of application C. Accordingly, each node in the cluster 430 may run different instances of related or unrelated, managed or native applications. As will be explained below, applications within a particular service group may be co-located, as with application instances 420A1 and 420A2. These application instances will fail over and load balance together, and will be maintained on the same physical cluster node.

The application manifest 115 may also specify one or more runtime characteristics for applications that are part of a specified service group. The runtime characteristics may indicate how many instances of each application are to be started when the application is initialized. Moreover, the runtime characteristics may indicate the name of each application that is to be started. Other runtime characteristics may also be specified in the application manifest.

Method 200 next includes an act of sending the manifest to the distributed host computer system which loads those applications that fit the manifest criteria on one or more available nodes of the distributed host computer system according to the service groups specified in the manifest (act 230). For example, application manifest 115 may be sent to distributed host computer system 130 by computer system 101. The manifest indicates which applications are to be loaded into which service groups, and for those application instances that are to be co-located, the host system loads the instances on the same cluster node (e.g. 131C). Once the applications are initially loaded, the applications' lifecycles may be managed by the user 105. Thus, the applications may be upgraded, removed, taken down, modified or otherwise managed on the host system 130. Managing the applications may also include making a determination as to which nodes currently have available computing resources. Then, once those nodes have been located, the managing may include starting the applications on those nodes that are determined to have available processing resources. Administrators may similarly choose nodes based on other factors such as a history of very high uptime, an ability to handle large numbers network requests, a secure status indicating that the node is physically secured from attackers, or any number of other factors. Thus, in this manner, administrators may manage both the service groups and the lifecycle of applications within the groups.

FIG. 3 illustrates a flowchart of a an alternative method 300 for managing multiple different types of applications using service groups. The method 300 will now be described with frequent reference to the components and data of environment 100.

Method 300 includes an act of receiving an application manifest that declaratively defines application dependencies and characteristics for each of a plurality of different service groups, such that each service group includes applications that match the declaratively defined application dependencies and characteristics (act 310). For example, application manifest generating module 110 may receive application dependencies 106 and application characteristics 107 from user 105. Computer system 101 may determine which applications are to be loaded in a specified service group based on the manifest (act 320). Then, those applications that fit the manifest criteria for the specified service group are loaded on a single node of the distributed computer system according to the manifest 115, such that those applications that fit the manifest criteria for the specified service group are co-located (act 330). Accordingly, in one example, applications 120A of service group 116A are those applications that met dependencies 106A and characteristics 107A.

Applications that are dependent on each other may be co-located (e.g. co-located application instances 422 (420A1 and 420A2) of FIG. 4). These co-located application instances may fail over and be load balanced together. For instance, the distributed host computer system 430 may determine that an application instance (e.g. 420A1) in a specified service group (e.g. service group 116A) has failed on node 431A of the distributed host computer system. After this determination, both the application instance that failed (420A1) and those application instances that are co-located in the service group are migrated to a different node (e.g. 431B) on the distributed host computer system. This migration may occur automatically upon determining that a hardware or software failure has occurred and that at least one of a group of co-located application instances has gone down.

As mentioned above, service groups may include both related and unrelated applications, as well as managed or native applications. Service groups may also include stateless applications (applications that do not maintain state) and/or stateful applications (applications that do maintain state). When stateless applications are grouped in a service group, the application can simply be restarted on a new node without worrying about what the state was when the application originally went own. When stateful applications are grouped in a service group and an application goes down, the state is stored and migrated to the new node along with the application. When stateful applications are run, the application state may be replicated on multiple different nodes of the distributed host computer system 430 nodes. Applications or services that are part of the same service group may experience atomic system state transitions that implement all-or-nothing semantics amongst the service replicas or instances that part of the service group. Stateful services that are part of the same service group may share a common replicator. This enables amortization of the cost of replicator resources.

In some cases, applications in the same service group may communicate with each other using local name resolution. For example, if applications A and B of FIG. 4 are in same service group, application A communicates with B using local name resolution to determine which physical node of the distributed host computer system application B is located on. Applications A and B may then work together, for example, to commit any application changes that are made. These changes may be made in a manner which provides the traditional ACID guarantees. As such, atomic groups provide the atomicity and durability guarantees specified by ACID. In other embodiments, administrators or other users (e.g. 105) may use service groups to build application logic workflows. These workflows may specify which tasks are to be performed by which service groups, and in which order. As such, service groups may be implemented in conjunction with each other to perform a specified task or series of tasks in an workflow. Accordingly, methods, systems and computer program products are provided which manage multiple different types of applications using service groups.

The concepts and features described herein may be embodied in other specific forms without departing from their spirit or descriptive characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the disclosure is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope. 

We claim:
 1. A computer system comprising the following: one or more processors; system memory; one or more computer-readable storage media having stored thereon computer-executable instructions that, when executed by the one or more processors, causes the computing system to perform a method for managing multiple different types of applications using service groups, the method comprising the following: an act of receiving an indication of one or more application dependencies and characteristics that are to be implemented when an application is provisioned on a distributed host computer system; an act of creating an application manifest that declaratively defines application dependencies and characteristics for each of a plurality of different service groups, such that each service group includes applications that match the declaratively defined application dependencies and characteristics; and an act of sending the manifest to the distributed host computer system which loads those applications that fit the manifest criteria on one or more available nodes of the distributed host computer system according to the service groups specified in the manifest.
 2. The computer system of claim 1, further comprising an act of managing the lifecycle of those applications that were loaded into a specified service group based on the manifest criteria.
 3. The computer system of claim 1, wherein at least one of the service groups comprises a plurality of different application types.
 4. The computer system of claim 1, wherein the service groups allow the applications to be scaled over a plurality of nodes in the distributed host computer system.
 5. The computer system of claim 1, wherein application characteristics include computational power specifications for the application.
 6. The computer system of claim 1, wherein at least one service group includes both related and unrelated applications.
 7. The computer system of claim 1, wherein applications in same service group are co-located, such that the applications move together during load balancing or failover.
 8. The computer system of claim 1, wherein the application manifest further specifies one or more runtime characteristics for applications that are part of a specified service group.
 9. The computer system of claim 8, wherein at least one of the one or more runtime characteristics specifies how many instances of each application are to be started.
 10. The computer system of claim 8, wherein at least one of the one or more runtime characteristics specifies the name of each application that is to be started.
 11. The computer system of claim 1, wherein an application author indicates that two or more of the applications are to be co-located on the distributed host computer system.
 12. The computer system of claim 1, wherein at least one of the application dependencies declaratively defined in the application manifest comprises an estimated workload for the application.
 13. The computer system of claim 1, wherein at least one of the application dependencies declaratively defined in the application manifest comprises one or more hardware characteristics for hardware that runs the application.
 14. The computer system of claim 1, wherein loading the applications on the one or more available nodes of the distributed computer system comprises determining which nodes currently have available computing resources and starting the applications on the determined nodes.
 15. A distributed computer system comprising the following: one or more processors; system memory; one or more computer-readable storage media having stored thereon computer-executable instructions that, when executed by the one or more processors, causes the computing system to perform a method for managing multiple different types of applications using service groups, the method comprising the following: an act of receiving an application manifest that declaratively defines application dependencies and characteristics for each of a plurality of different service groups, such that each service group includes applications that match the declaratively defined application dependencies and characteristics; an act of determining which applications are to be loaded in a specified service group; and an act of loading those applications that fit the manifest criteria for the specified service group on a single node of the distributed computer system according to the manifest, such that those applications that fit the manifest criteria for the specified service group are co-located.
 16. The computer system of claim 15, further comprising: determining that an application instance in a specified service group has failed on at least one node of the distributed host computer system; and migrating both the application instance that failed and those applications that are co-located in the service group to a different node on the distributed host computer system.
 17. The computer system of claim 15, wherein at least one of the service groups comprises both stateless applications and stateful applications.
 18. The computer system of claim 17, wherein applications in the same service group communicate with each other using local name resolution.
 19. The computer system of claim 16, further comprising building one or more application logic workflows using service groups.
 20. A distributed computer system comprising the following: one or more processors; system memory; one or more computer-readable storage media having stored thereon computer-executable instructions that, when executed by the one or more processors, causes the computing system to perform a method for managing multiple different types of applications using service groups, the method comprising the following: an act of receiving an application manifest that declaratively defines application dependencies and characteristics for each of a plurality of different service groups, such that each service group includes applications that match the declaratively defined application dependencies and characteristics; an act of determining which applications are to be loaded in a specified service group; an act of loading those applications that fit the manifest criteria for the specified service group on a single node of the distributed computer system according to the manifest, such that those applications that fit the manifest criteria for the specified service group are co-located; determining that an application instance in a specified service group has failed on at least one node of the distributed host computer system; and migrating both the application instance that failed and those applications that are co-located in the service group to a different node on the distributed host computer system. 